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Abstract 



We study a wireless broadcast network, where a single source reliably communicates 
independent messages to multiple destinations, with the aid of relays and cooperation 
between destinations. The wireless nature of the medium is captured by the broadcast 
nature of transmissions as well as the superposition of all transmit signals plus inde- 
pendent Gaussian noise at the received signal at any radio. We propose a scheme that 
can achieve rate tuples within a constant gap away from the cut-set bound, where the 
constant is independent of channel coefficients and power constraints. 

The proposed scheme operates in two steps. The inner code, in which the relays 
perform a quantize-and-encode operation, is constructed by lifting a scheme designed 
for a corresponding discrete superposition network. The outer code is a Marton code 
for the non-Gaussian vector broadcast channel induced by the relaying scheme, and is 
constructed by adopting a "receiver- centric" viewpoint. 

1 Introduction 

The scenario of study in this paper is depicted in Figure HJ A single source node is reliably 
communicating independent messages to multiple destination nodes using the help of multiple 
relay nodes. In the example of a cellular system, the setting represents downlink communi- 
cation where the base-station is transmitting to multiple terminals with the potential help 
of relay stations. Note that some of the terminals can themselves act as relays. A special 
instance of our setting is the following: only the source node and multiple destinations (i.e., 
no relays) are present. Since we have allowed the ability to transmit and receive at all nodes, 
this special instance models the downlink of a cellular system with the destinations having 
the ability to cooperate among themselves, which has been studied in [9] . 

We consider the canonical Gaussian channel model among the various nodes in the network: 
time is discrete and synchronized among all the nodes. Denoting the baseband transmit 
symbol (a complex number) of node k at time m by £fc[m], the average transmit power 
constraint at each node implies that, 
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Figure 1: A wireless broadcast network 



where T is the time period over which the communication occurs. At each time m, we have 
the received signal at any node I 

yd m ] = h ki[m]x k [m\ + Z([m\. (2) 

k^t 

Here {^[^]} m is i.i.d. Gaussian noise and independent across the different nodes t. The 
channel attenuation between a pair of nodes (k, £) is supposed to be constant over the 
time scale of communication. Note that by normalizing the channel attenuation hki, without 
loss of generality, we will assume unit average power constraints at each node, i.e. Pk = 1 and 
also the variance of zg[m\ to be 1. We suppose full duplex mode of operation for the most 
part, while discussing the implications of half duplex mode later in the paper. We suppose 
single antenna at each node and leave the discussion with multiple antennas for a later part 
of the paper. We will begin with the supposition that these channel attenuations are known 
to all the nodes in the network, and revisit this requirement later. 

Let V denote the set of all nodes in the network. A (2 TRl , . . . ,2 TRj ,T) coding scheme 
for the broadcast network, with source node S and destination nodes Di, . . . , Dj, which 
communicates over T time instants is comprised of the following 

1. Independent random variables Wi which are distributed uniformly on [2 TRi ] for i = 
1, . . . , J respectively. Wi denotes the message intended for destination Di. 

2. The source mapping, 

fs : (Wi x . . . x Wj) -+ Xl (3) 

3. The relay mappings for each v G V\ {S} and t G [T], 

ht ■ yir 1 -> Xv. (4) 
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4. The decoding map at destination D i} 

g Di : y T Di -> Wi. (5) 
The probability of error for destination i under this coding scheme is given by 

PI = Prim^Wi}. (6) 

A rate tuple (Ri, R2, Rj), where Ri is the rate of communication in bits per unit time for 
destination Di, is said to be achievable if for any e > 0, there exists a (2 TRl , 2 TR2 , 2 TRj , T) 
scheme that achieves a probability of error lesser than e for all nodes, i.e., maxj P % e < e. The 
capacity region C is the set of all achievable rates. 

The following is the well known cut-set upper bound to the rate tuples of reliable com- 
munication [5j Hj: denoting the set of all nodes by V; and for all subsets J C [J], where [J] 
denotes the set {1, . . . , J}, denoting Aj to be the collection of all subsets fl C V such that 
the source nodes S G f2 and a subset J C [1 : J] of destinations T>j G f2 c ; we have that if 
(i?i, . . . ,Rj) is achievable, VJ7", there is a joint distribution p({X v \v G V}) (denoted by Q) 
such that 

Rj < Cj{Q) = min I (Y Q c; X n \X n .) , (7) 

where Rj d = ]C iG j ^j- 

Let C(Q) denote the set of all rate tuples that satisfy the cut-set upper bound for a given 
joint distribution Q, and C denote the cut-set bound: 

C(Q) = {(Rx, Rj) : Rj < Cj(Q) VJ C [1 : J]} (8) 
C d = f conv I |J C(Q) J , (9) 

\{Q:E|X„| 2 <1} / 

where conv (•) denotes the convex hull of the region. 
Our main result is the following. 

Theorem 1. For the wireless broadcast network, a rate vector (R\, . . . , Rj) is achievable if 

+ k, . . . , Rj + k) E C (10) 

for some constant k, which depends only on the number of nodes, and not on the channel 
coefficients, and k = 0(| V| log |V|). 

With a single destination node (J = 1), the scenario reduces to the classical Gaussian 
(unicast) relay channel. Pioneering work of pQ has obtained an approximate characterization 
of the capacity for this scenario. In recent work, [31 Ej derive similar approximation results 
with different coding schemes. In particular, [TJ [3] take a two-step approach in their coding 
scheme: first, they develop deterministic models that approximate the Gaussian channel and 
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next, they construct the codes for the Gaussian channel based on the codes for the corre- 
sponding deterministic channel approximation. The specific approaches adopted in the actual 
deterministic approximation and moving from the codes for the deterministic channel to the 
Gaussian one are different between [1] and [3]; as such, we follow the approach of [3] in our 
proof of the main result. 

The proposed scheme operates in two steps. The inner code, in which the relays essentially 
perform a quantize-and-encode operation, is constructed by lifting a scheme designed for 
a corresponding discrete superposition network. This induces a vector broadcast channel 
between 

The outer code is essentially a Marton code ([7J E\) for the broadcast channel induced by 
the relaying scheme, and is constructed based on a "receiver-centric" viewpoint. 

The rest of the paper is organized as follows. In Section [2J we give a coding scheme and 
establish an achievable rate region for deterministic broadcast networks. In Section [3J we prove 
Theorem [T] by giving a coding scheme for the wireless broadcast network. In order to do so, 
we use the discrete superposition network, which is an approximation to the wireless network, 
and the "lift" the scheme from the discrete superposition network to the Gaussian network. 
In Section HI we discuss various aspects of the proposed scheme, primarily the reciprocity in 
the context of linear deterministic and Gaussian networks, and the channel state information 
required. In Section [5j various generalizations of the scheme are provided, for half-duplex 
networks, for networks with multiple antenna and for broadcast wireless networks, where some 
set of nodes demand the same information and other nodes demand independent information. 

2 Deterministic Broadcast Networks 

In the deterministic network model, the received signal at each node is a deterministic function 
of the received signals. 



The input and output alphabet sets, A^'s and JVs respectively, are assumed to be finite sets. 

As before, we have the following cut-set upper bound to the rate tuples of reliable com- 
munication [SJ H]: if (Ri, . . . , Rj) is achievable, then \/J C [1 : J], there is a joint distribution 
p({X v \v G V}) (denoted by Q) such that 



We prove the following achievability result for the deterministic channel. 

Theorem 2. For the deterministic broadcast network, a rate vector (Ri, . . . , Rj) is achievable 
ify J , there is a product distribution Y[ p(X v ) (denoted by Q p ) such that 
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Figure 2: A Layered broadcast-relay network 



Remark 1. Aref networks, i.e., deterministic broadcast networks where each node can receive 
information from every incoming edge separately, were studied in [2^]. It is shown there 
that cut-set bound can be achieved for these networks. This result can also be recovered from 
Theorem^ by observing that product form distributions optimize the cut-set bound. It should 
be noted, however, that the scheme proposed in \2J$ is a separation-based scheme whereas the 
scheme proposed here is not. 

Proof. We prove Theorem [2] for the layered network here. The arguments can then be extended 
to the general network by using time expansion as done in [I]. A network is called a L-layered 
network if the set of vertices V can be partitioned into L disjoint sets, such that only the source 
node S is in the first layer and the J destination nodes are in the L-th layer. The nodes in 
the intermediate layers are relaying nodes. The received signal at the nodes in the I + l-th 
layer only depend on the transmitted signals at the nodes in the Z-th layer. This dependency 
is often represented by edges connecting the nodes from the Z-th layer to the (I + l)-th layer. 
An example of a layered broadcast network is shown in Fig. [2j The advantage of working with 
a layered network is that we can view the information as propagating from one layer to the 
next without getting intertwined. 



2.1 Outline of Coding Scheme 




Figure 3: Coding scheme 

The coding scheme operates over three levels of nested blocks as shown in Figure [3j If T 
is the total time period of communication, then T = T1T2T3. The innermost level is level-1 
and consists of Ti time instants. T 2 such level-1 blocks constitutes a level-2 block. The overall 
coding scheme operates over T 3 level-2 blocks. Our coding scheme comprises of the following: 
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• The message Wi is broken into T 3 independent sub- messages Wi(l),. . . , Wi(T 3 ), where 
each sub-message is encoded over a particular level-2 block. 

• The relay mappings are done at the level-1 block. Every node blocks up T\ received 
symbols and maps it to T\ transmit symbols. This relay mapping is fixed for the duration 
of a level-2 block and it essentially creates an end-to-end deterministic broadcast channel 
at the level-2 block. 

• The encoding at the source is done on the induced end-to-end deterministic broadcast 
channel for each level-2 block. Across different level-2 blocks, the mappings are generated 
in an i.i.d. manner and corresponding broadcast schemes are used for the different end- 
to-end broadcast channel induced by this operation. This is very much like a fading 
broadcast channel. Here the random fading is introduced by the relay mappings. This 
is merely a proof technique which allows us to average the performance over random 
relay mappings. 

• The destinations decode the sub-message corresponding to each level-2 independently 
and sequentially. 

2.2 Coding Scheme in Detail 

Throughout the discussion below, we fix a particular product distribution Q p , which is then 
used to describe a random ensemble of coding operations. 

2.2.1 Relay mappings 

In the proposed scheme, the relays operate over level-1 blocks, i.e., each relay transmits in a 
level-1 block using only the information from the previous received level-1 block. We will use 
x r = f xj 1 and y r = f yj 1 to denote the transmit and receive block at any relay node r £ Vr, 
where Vr is the set of all relay nodes. The mapping at the relay node denoted by 

fr(h) -.yj 1 ^Xj\ (14) 

is random and generated i.i.d. from the distribution p(X r ), i.e., V^ 1 £ yj 1 , generate xj 1 
i.i.d. from p(X r ). Note that the relay mapping is only a function of £3 i.e., it is fixed across 
all level-1 blocks in a given level-2 block. Across different level-2 blocks, i.e., for each i 3 , it 
is generated in an i.i.d. manner. As mentioned earlier, this induces an end-to-end broadcast 
channel as shown in Figure HI a different one across every level-2 block. 

2.2.2 Source Mappings 

The capacity of the deterministic broadcast channel is well known ([7J HQ]). For our coding 
scheme over the induced deterministic broadcast channel of Figure HJ we use the coding scheme 
similar to the one described for the deterministic broadcast channel in [8], which we refer to 
henceforth as the "Marton code". The source codebook for the 23-th level-2 block, which maps 
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Figure 4: Effective end-2-end deterministic broadcast channel created by a level-1 code. 

the message Wi(t 3 ) £ [2 TlT2Ri ^],i = 1,2,..., J to transmit symbol block x^ 2 , is described 
below. 

Given the random vector X s which is distributed as p{Xs) = YIp(Xs), the channel and 
the relay mapping induce the joint distribution over the random variables ( Xs, Yv ) ■ Create 



auxiliary random variables such that pg fi D fj D ff D is the same as p% f D f o f D . 

The set Tj 2 (UdJ of all typical u^. are binned into 2 TlT2Ri ^ bins, where each bin index cor- 
responds to a message, for i — 1, 2, J. For each vector {u^ t , ■ ■ ■ , u^ r ) £ TJ 2 ^Yd x , ■ ■ ■ , Ydj), 
there exists a sequence ^(w^ 2 , . . . , u^ 3 ), since the channel is deterministic, such that (xg 2 , u^ v 

T§' 2 {Xs,Yd 1 , ■ ■ ■ ,Ydj)- This specifies the codebook for the given level-2 block £ 3 . Similar 
codebooks are generated, statistically independently, for all the T 3 level-2 blocks. 

2.2.3 Encoding 

The messages W { £ [2 TlT2TaRi ] are split into sub-messages Wi(t 3 ) £ [2 TlT2jRl( * 3) ] such that 

J2Mt3)=RiT 3 . (15) 

t 3 

For the t 3 -th level-2 block, the messages to be transmitted are given by (Wi(t$), . . . , Wj(t 3 )) 
for the J destinations respectively. To transmit the message, the source looks at the codebook 
for the level-2 block t 3 and tries to find a vector (uj 2 , . . . , u 7 } 2 ) £ T^ 2 (U±, . . . , Uj) such that 
uj 2 is also in the bin with index Wi{t%). If the source can find such a vector, it transmits 
x^ 2 (uj 2 , . . . ,u T j 2 ). If the source cannot find such a sequence it transmits a random sequence. 

2.2.4 Decoding 

At the end of £ 3 -th level-2 block, the destination D{ decodes the transmitted message Wi(t 3 ). 
The destination tries to find the bin in which the received level-2 block y 7 ^, (t 3 ) falls and decodes 
that bin index as the transmitted message. 
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2.3 Performance Analysis 

We begin with identifying rate constraints for the t3-th level-2 block, so that arbitrarily small 
probability of error can be achieved for decoding the message at this block. From the coding 
theorem for the deterministic broadcast channel ([7], Theorem 3), we know that as long as 
Ri(t 3 ) satisfies, for i — 1, . . . , J and VJ7" C {1, . . . , J}, 



Rj(h) < ^H{Y Dj \F Vn = / VK (t 3 )), (16) 
J i 



the probability of error can be made arbitrarily small by choosing a large enough T2. The 
overall rate Ri is given by . Therefore the rate tuple (Ri, . . . , Rj) satisfies 

Rj<j^rjt H (Y Dj \F Vn = /v*(ts)), V J C {1, . . . , J} . (17) 
By the strong law of large numbers, as T 3 — > 00, we have 

-±-f^H(Y Dj \F Vn = f Vn (t 3 )) a 4 ±EH(Y Dj \F Vn = f Vn (t 3 )) 

*3=1 

= ±H(Y Dj \F Vn ). (18) 
J 1 

Next, we relate the expression in ( fl8l) to the cut-sets using the following lemma: 
Lemma 1. Given arbitrary e > 0, 3 T\ s.t., 

H(Y Dj \F Vr ) = H(Y^\F Vr ) > T^CjiQ,) - e), (19) 

VJC{1,...,J}. 

Proof. See Appendix lAl □ 
Using ( ITTj) . (Tl8l) and Lemma (TJ we conclude that for any rate tuple satisfying 

Rj < Cj(Q p ) - e, (20) 

arbitrarily small probability of error is achieved. The proof is finally completed by allowing 
time-sharing between the coding schemes for all product distributions Q p . □ 

3 Gaussian Broadcast Networks (Proof of Theorem d) 

3.1 Layered Network 

As for the deterministic network, we will first consider a layered network. For the deterministic 
network, we first used the random relay mappings to create an effective vector broadcast 
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channel. Then we used the Marton scheme with a specific choice of auxiliary random variables, 
which is optimal for the deterministic broadcast channel. For the Gaussian network, while it 
is possible to do the inner code similarly and induce a broadcast channel, this is a vector non- 
Gaussian broadcast channel for which it is unknown whether a Marton scheme can achieve 
any rate within a constant gap of the cut-set bound. To deal with this issue, we convert the 
Gaussian network into a deterministic network, for which we can design the germane code 
and then appropriately "lift" the code from the deterministic to the Gaussian network. 
The procedure is outlined as follows: 

1. Given the Gaussian broadcast network, we construct a corresponding deterministic su- 
perposition network (DSN). The cut-set bound of the DSN approximates the cut-set 
bound of the corresponding Gaussian network to within a factor N log N. Further, the 
DSN is deterministic and thus the scheme in Theorem [2] can be used for the DSN to 
achieve the cut-set bound evaluated under product-form distributions. The details are 
provided in Sec 13.1.11 

2. We then prune a natural coding scheme V{k) for the DSN, such that the rate is reduced 
by a factor Nn. The details are in Sec. 13.1.21 

3. Finally, we show that for an appropriate choice of k = O(logiV), the pruned coding 
scheme on the DSN can be emulated on the Gaussian network, because each node in 
the Gaussian network can decode the corresponding received vector in the DSN. The 
details are given in Sec. 13.1.31 Therefore, the rate achieved in the Gaussian network by 
the proposed scheme achieves within a gap of 0(N log N) of the cut-set bound. 

3.1.1 Discrete Superposition Network (DSN) 

Given a Gaussian network, we construct a discrete superposition network. This network is 
essentially a truncated noiseless version of the Gaussian model. Further, the input in this 
model is restricted to a finite set. 

Corresponding to the channel model for the Gaussian network given by (J2]), the received 
signal in the DSN is given by 



where [•] lies in Z+zZ and corresponds to rounding the real and imaginary parts of the complex 
number to the nearest integer. The transmit symbols x^frn] are restricted to a discrete and 
finite complex valued set. This defines the DSN. Note that our model is very similar to the 
truncated model in [1] and to the model described in [3]. 

Next, the following lemma relates the cut-set upper bound of the Gaussian network, C Ganss , 
to the cut-set of the DSN under product distribution. 

Lemma 2. There exists a Q p for the DSN such that 




(21) 



C 



Gauss 



QC DSN (Q p ) + h(l,l,...,l), 



(22) 
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where k 2 = 0(|V|). 



Proof. See Appendix [Dl □ 

Note that since the DSN is a deterministic network, we have the following corollary of 
Theorem |2J 

Corollary 1. For the DSN, a rate vector (Ri,...,Rj) is achievable if there is a product 
distribution Q p such that, V J , 

Rj < C§ SN (Q P ). (23) 

We observe that, here Q p is product distribution over the finite input alphabet set of the 
DSN. 



3.1.2 Pruned Coding Scheme V(k) for the DSN 

The pruned coding scheme described next is along the lines of the ideas developed in [3J. 
The main idea there is that a coding scheme from the DSN can be used in the Gaussian 
network, if at every node, the received vector at any node in the DSN can be decoded from 
the corresponding received vector in the Gaussian network. This will not, in general, be true 
for any scheme in the DSN. Therefore, to get the source codebook for the Gaussian network, 
the codewords transmitted by the source in the DSN are pruned to a smaller set in such a way 
that the received vector at any node in the DSN can be decoded from the received vector in 
the Gaussian network. In this section, we will demonstrate how to construct a pruned coding 
scheme V(k), for which the rate is reduced by a constant Nk. In Sec. I3.1.3[ we will show how 
to choose the parameter k such that the Gaussian network can emulate the DSN. 

Our coding scheme for the deterministic network described in Section 12.21 involved three 
levels. Level-1 involved relay mappings over a level-1 block of T\ time symbols defined by 
fy a . Then the level- 2 code involved a Marton scheme for this fixed fy R . This code was over a 
level- 2 block, which comprised of T 2 level-1 blocks. Then at level-3, we repeated this Marton 
code over T 3 level- 2 blocks, where for each super-block the relay mapping fy was generated 
i.i.d. using Fy. For the Gaussian network, we will maintain the same level-1 and level-?) codes, 
but will modify the Marton code by pruning the set of codewords transmitted by the source. 
This is described in detail next. 

Pruning the Level-2 (Marton) code: We now specify the operation during the ^-th level-2 
block. In this level-2 block we need to send the messages Wi G [2 TlTa ^^)] at rates Ri(t 3 ), 
for % — 1, . . . , J. As before, since the the relays maintain the same mapping fy R {t%) for the 
duration of this level-2 block, we have an effective end-to-end deterministic channel as shown 
in Figure HI 

As before, given the random variable at the source X$ = Aj 1 which is distributed as 
p(Xs) = Y[p(Xs), the fixed relay mapping and the channel induces the joint distribution on 
(Xg,Yy). Let T^ 2 (Y V ) denote the set of all typical received block vectors at node v. 

Previously, we created the codebook by binning the typical sets T^ 2 {Y Di ). But now, we 
prune this set before binning. This will, in effect, lead to pruning the set of typical vectors 
T^ 2 (Y V ) at all nodes. To do this pruning, we fix a constant k, which will be defined in 
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Section 13.1.31 At every node v, we pick a random 2 TlT2K fraction of T^ 2 (Y V ) and call this 
subset & v . We then define, 

X d = f : V., 30? G 6,, s.t. ^ e 7f 2 (Fv)}. (24) 

We will prune our codebook so that only codewords in 3, are typically received. 

Following the Marton code for the deterministic channel, we bin the set of all 3 a > the 
typically received vectors at the destination, into 2 TlT2Ri ^ bins, for i = 1, . . . , J. For a given 
message pair Wi,...,Wj, we select a jointly typical y Dl ,...,y Dj G 7^ T2 (3z)i, 3dj) with y Di 
belonging to the bin corresponding to the message Wi, and transmit the x T2 which is jointly 
typical with all the yo v The decoder on receiving a.y^., finds the index of the bin into which 
it falls and reports this as the message Wi. This is our pruned codebook for the level-2 block 
t 3 . Similarly pruned codebooks are generated for all the T 3 level-2 blocks. 

Performance Analysis: 

As before, we first find the rate constraints for the t3-th level-2 block, so that arbitrarily 
small probability of error can be achieved for decoding the message at this block. For this 
purpose, we first need to establish a lower bound on the size of 3/v Towards this, we have 
the following lemma. 

Lemma 3. 

|3,| > 2 T ^ H ^n=fv R )- NT ^) w .h.p. as T 2 oo, (25) 

where N — |V| — 1. 

We use the notation a n > 2 nb , to denote the existence of a nonnegative sequence e n — > 
such that a n > 2 n ^ b ~ en \ Notations < and = are used in a similar sense in the rest of the paper. 
For a sequence of events (£(n) indexed by n, we use "(£(n) w.h.p. as n — > oo" to denote that 
P{(£(n)} — > 1, as n — > oo. In this section T 2 will play the role of n. 

Proof. The details of the proof are in Appendix El To prove this lemma, we use the random- 
ness in the choice of the pruned subsets & v . □ 

The following lemma then characterizes the achievable rates, Ri(ts), by our pruned scheme 
in a ^3-th level-2 block. 

Lemma 4. As long as Riit^) satisfies, Wi = 1, . . . , J, WJ C {1, . . . , J}, 

Rj(h) < ±H(Y Dj \F Vn = f Vn (t 3 )) - \J\Nk, (26) 
J i 

the probability of error can be made arbitrarily small by choosing a large enough T 2 . 

Proof. The result follows from the fact that, our pruned version of the Marton scheme bins 
3 instead of T^ 2 (Y D .). The details of the proof are in Appendix O □ 
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As before, we can now average over the rates in each level-2 block, to get the overall rate 
Ri. Under the pruned scheme, each rate is reduced by Nk. Thereby, we have the following 
lemma. 

Lemma 5. A rate tuple R = (R±, . . . , Rj) is achievable for the DSN, using the pruned scheme 
V(k), if\/ J , there is a product distribution Y[ p{X v ) (denoted by Q p ) such that 

Rj < Cj(Q p ) - \J\Nk. (27) 

3.1.3 Code for the Gaussian Network 

We will now show that the pruned scheme for the DSN can be lifted to a scheme for the 
Gaussian network to establish the following result. 

Lemma 6. If k = log(12A r — 2) + ll, then any rate tuple (R±, i?2, Rj) that can be achieved in 
the DSN using the pruned coding scheme V(k) can also be achieved in the Gaussian network. 

Proof. Note that the pruned coding scheme V(k) for the DSN operated over three levels of 
nested blocks. This scheme is now adapted to the Gaussian network as follows. We will use 

— # — * 

K. v i¥-v to denote the level-2 blocks in the Gaussian network (corresponding to X V ,Y V in the 
DSN). 

1. The coding over level-2 blocks is performed in the same way as in the DSN. We will 
describe the operation during the t3-th level-2 block. In this level-2 block the relay 
mapping is fixed to fv R fe): we wm suppress this conditioning henceforth for notational 
convenience. 

2. Given a message Wife), the source transmits the same vector that it would have trans- 
mitted in the DSN, i.e., g$(Wife), Wjfe)) = 5^(Wi(t 3 ), Wjfe)). 

3. Node v receives y T2 and decodes y^ 2 , the corresponding vector that it would have received 
in the DSN. 

Lemma 7. If k = log(12A r — 2) + ll, then the probability of error for decoding the received 
vector y^ 2 in the DSN from the corresponding vector y T2 in the Gaussian network goes 
to zero as T 2 -> oo. 

Proof. There is a natural joint distribution on (Xs,Y v ,Y_ v ) given by 

p(X s ,Y v ,Y v ) = p(X s ) P (Y v \X s )p(Y v \X s ) (28) 
= p{X s )p{Y l \X s )p{Y v \X s ). (29) 

— * —* 

Here p(Y v \Xs) and y(Y v \X s ) are a function of the relay mappings. The relay node in 
the Gaussian network finds the y^ 2 G & v such that0 

(£\yI 2 )eT^(Y v ,Y v ). (30) 

1 While the relay also knows that the received vector should be contained in 3u ^ 6», we do not explicitly 
use this information in the decoding step. 
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& v is a random 2 TlT2K fraction of the typical set T^ 2 {Y V ). Therefore 

\& v \ = T T ^1 T ^ % ). (31) 

If we choose n = log(12JV— 2) + ll, then Tin > H(Y V \Y_ V ) (see [3] for a similar argument). 
Therefore, 

|e,| < 2- T * H &'^2 T > H ^ (32) 
^\& v \ < 2 T ^^\ (33) 

This condition ensures that the typical set decoding succeeds with high probability, and 
thus we can decode Yj 2 from Yj^ 2 . □ 

4. Any node v now performs the same mapping as in the DSN, 

= /„(»?), (34) 

and transmits the vector xj 2 = x^ 2 . 

5. The destination D{ reconstructs the corresponding vector from y T2 , and since de- 
coding is possible in the DSN, it is possible in the Gaussian network as well. 

To conclude: this procedure yields a code for the Gaussian network that can achieve the 
same reliable rate of communication as the pruned code in the DSN. □ 

3.1.4 A Simplified Coding Scheme 

Our coding scheme for the deterministic and the Gaussian networks comprised coding over 
three levels of blocks. We will show now that, if we consider the maximization of a linear 
functional of the rate, the third level is not necessary (i.e., T 3 = 1 is sufficient) in an operational 
scheme, and is used only as a random coding technique. Suppose we want to maximize a 
certain linear functional of the rate, 

max j XiRi- (35) 
flec(Q P ) ^ 



In Section 12.21 we saw that the average of the rate across various level- 2 blocks yields the rate 

Ri = T 3 T,Js=i R ^)- Thus 

i n 

i 3 t 3 =l i 

which implies that there exists a t% such that ^jAji?i(ts) > ^Aj-Rj. If we use the corre- 
sponding relay encoding functions fvnih) throughout, then we achieve the best possible linear 
functional of the rate under this scheme. Thus coding over T3 level-2 blocks is unnecessary for 
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maximizing a linear functional in the rate region. Thus coding over multiple relay transforma- 
tions is only a proof technique and not a method of operation of the network for maximizing 
a linear functional of the rate. 

By the convexity of the rate region of the scheme, all extreme points in the rate region can 
be achieved by setting T3 = 1. To achieve any point inside the rate region, time sharing of 
these schemes will be required in general. Since the relay encoding functions /v H (^3) required 
to achieve this may be different for different extreme points, time sharing can be thought of 
as being equivalent to coding with a larger T 3 . 

3.2 Non-layered Networks 

Consider a general network specified by the set of vertices V. In [T8| [TJ, it has been shown in 
the context of unicast traffic that any network can be unfolded in time to get a layered network. 
In the broadcast scenario here, we use the same procedure. The layering strategy is briefly 
described below. Similar to pQ, the level-1 (inner) block is now over KT\ time symbols. The 
relay node still does random mappings over blocks of T\ symbols, however the transmit vector 
in fc-th block, for k — 1, . . . , K, now depends on the last k — 1 received blocks. This relaying 
scheme can then be represented as a layered-network in time. The induced layered network 
then has K layers, and each layer has |V| + J + 1 nodes - the |V| nodes v[k], k = 1,2, |V| 
corresponding to the network at time slot k and J + 1 special nodes T[k], Ri[k], Rj[k] that 
act as virtual transmitters and receivers that act as transmit and receive buffers, holding 
all the information about source message in the case of transmit buffer and holding all the 
received information in the case of received buffer. The set of edges connecting the adjacent 
layers is derived from the network by the following procedure: 

1. Vi[k] is connected to v 2 [k + 1] with a link h Vl)V2 which is the channel corresponding to 
the link from V\ to v 2 in the original network. 

2. Memory inside a node is maintained by connecting Vi[k] to Vi[k+ 1] using an orthogonal 
and infinite capacity link. 

3. The transmit buffer is maintained by connecting T[k] to T[k + 1] and also T[k] to S[k + 1] 
using orthogonal and infinite capacity links. 

4. Receive buffer for the the destination node j is maintained by connecting Rj [k] to Rj [k + 
1] and from Dj[k] to Dj[k + 1] using orthogonal and infinite capacity links. 

5. A 0-th layer with 5* = T[0] alone, and a K + 1-th layer with Dj = R[k + 1], j = 1, J 
are added to serve as the source and destinations in this unfolded network. 

Let us consider the cut-set bound (normalized by K) between the source and the destina- 
tion nodes J for this unfolded network Cj~ nn (Q) and for the original network by Cj g (Q). 
Then, we have the following lemma (See proof of Lemma 5.1 in [1]). 

Lemma 8. C^ unf (Q) = ^C°J 9 {Q). 
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If we take K large enough, the effect of the penalty term ~^ can be made as small as 
desired. Lemma El combined with our result for layered networks, yields the desired result for 
general (non-layered) networks. 

4 Discussion 

In this section, we first review linear deterministic broadcast networks, and the capacity- 
achieving schemes for such networks. We then show how these schemes are reciprocal to 
the schemes for the multi-source single-destination deterministic networks. Then we pro- 
ceed to identify the intuition connecting the reciprocal schemes based on a contrast between 
transmitter- centric and receiver- centric viewpoints, and use this intuition to point out the in- 
spiration for the Gaussian broadcast network scheme. At the end of this section, we point out 
the channel state information required at various nodes in order to implement these schemes. 

4.1 Linear Deterministic Networks (LDN) 

A deterministic network of particular interest is the linear finite-field broadcast deterministic 
network pp. The inputs and outputs are vectors over a finite field, i.e. Xj = y,j = F£, for some 
prime p and q G N. The channels are linear transformations over this finite field i.e., 

yA m ] = G i,i x i[ m ^ ( 37 ) 

where G^j G ¥^ xq . In particular, Gij are often assumed to be "shift" matrices. The linear 
deterministic model captures wireless signal interaction like interference and broadcast and 
on the other hand has an algebraic structure that can be exploited for understanding schemes 
in this network. 

Corollary 2. For the linear deterministic broadcast network the capacity region is in fact the 
cut-set bound. 

Proof. We can show that for the linear deterministic network the cut-set bound, Cj(Q), is 
maximized by uniform and independent distribution of G V}. Therefore 

Cj(Q) = Cj(Q p ) = min rank(G n , n =), (38) 

where Gn,n c is the matrix relating the vector of all the inputs at the nodes in Q to the vector 
of all the outputs in Q c induced by f l37j) . The inner bound has already been shown for the 
general deterministic network in Theorem [2j This proves the corollary. □ 

An important question is whether simpler linear schemes are optimal for these networks. 
It has already been shown in [1] that for the single-source single-destination relay network, 
linear mappings at all nodes suffice. The intuition behind the proof is that, when the relay 
nodes randomly pick transformation matrices, the resulting matrix between the source and 
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the destination has rank equal to the min-cut rank of the network, with high probability. 
Therefore, if the rate is lesser than the min-cut rank, random linear coding at all nodes 
(including the source but not the destination) ensures an end-to-end full-rank matrix and the 
destination, knowing all these encoding matrices, picks up a decoding matrix, which is the 
inverse of the end-to-end matrix. This intuition is then used to obtain schemes in the general 
deterministic relay network and the Gaussian relay network in p], where the relays perform 
random mapping operations resulting in an induced end-to-end channel between the source 
and the destination. Then the source uses a random code to map the messages, and the 
destination performs a typical set decoding. It has also been shown in [13] and [H], that for 
the linear deterministic relay network, restricting the relay mappings to permutation matrices 
is without loss of optimality. The next corollary claims a similar result even for the linear 
deterministic broadcast network. 

Corollary 3. For linear deterministic broadcast network, linear coding at every node is suf- 
ficient to achieve capacity. Furthermore, the mapping at relay nodes can be restricted to 
permutation matrices. 

Although this can be proved directly, we will use the connection between linear coding and 
reciprocity to prove this in the next section. 

4.2 Reciprocity 

The reciprocal of a Gaussian communication network (with unit power constraint at all nodes) 
with multiple unicast flows can be defined as the network where the roles of the sources and 
the destinations are swapped. Note that any channel coefficient that captures the signal 
attenuation between a pair of nodes is the same in either direction. For a linear deterministic 
network, the reciprocal network was defined in [15] as the network where the roles of the 
sources and the destinations are swapped, and the channel matrices are chosen as transposes 
of each other in the forward channel for the network and its reciprocal. 

While it is unresolved whether a given network and its reciprocal have the same capacity 
region, many interesting examples are known for which this is true. For some cases, this 
reciprocity is applicable even at the scheme level. 

• Wire-line networks can be considered as a special case of wireless networks studied here. 
It has been shown in [19] that wire-line networks are reciprocal (also called reversible in 
the literature) under linear coding. 

• In [IB] , it was shown that reciprocity, under linear coding, can be extended naturally 
to the linear deterministic network. The reciprocity was shown at the scheme level and 
the coding matrices at each node can be obtained from the reciprocal network. 

• In Gaussian networks, duality has been shown, [20j El], between the multiple access 
channel (MAC) and broadcast channel (BC), where it was shown that the capacity 
region of the MAC is equal to the capacity region of the BC under the same sum power 
constraint. This duality was also shown, interestingly, at the scheme level between the 
dirty-paper pre-coding for the BC and the successive cancellation for the MAC. 
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The reciprocal network corresponding to the broadcast network studied here is the network 
with many sources and one destination. 

4.2.1 Sufficiency of Linear Coding in LDN 

The multi-source single-destination network has been studied in [6j [161 EO 122] , the capacity 
region for the linear deterministic network with many sources and one destination is estab- 
lished and it is further shown that linear coding is sufficient to achieve this. This is done by 
converting the problem to the case of single-source single-destination by adding a super-node 
and connecting all the source nodes to the super-node by orthogonal links with capacities 
equal to the rate required for that source. Since random linear coding at the source and the 
relays works for the single-source single-destination network, it works for this network too. 
Therefore, the source nodes and the relay nodes perform random mappings, and the desti- 
nation, knowing the source and relay mappings, can then carefully pick the decoding matrix 
that inverts this overall matrix. Since this coding is linear, we can use the reciprocity result of 
|15j . to show that any rate achievable in the dual multiple-source single-destination network 
is also achievable in the single-source multiple-destination case. Along with the fact that the 
cuts are reciprocal in these two networks, this implies that linear coding is optimal even in 
the case of the linear deterministic broadcast network. This result has also been shown in [TT] 
without using reciprocity by adopting an algebraic approach. 

Furthermore, from the results of [13] and [H], it can also be shown that the sources and 
the relays can pick up specific permutation matrices for the single-source single- destination 
network. The above argument can then the extended to show that a coding scheme involving 
only permutation mappings at the relays is sufficient for the linear deterministic broadcast 
network. 

4.3 Receiver- Centric Vs. Transmitter- Centric Schemes: Intuition 
for the Gaussian Broadcast Network Scheme 

We now continue on our discussion on duality for linear deterministic network to illustrate 
how these ideas lead us to a scheme for the Gaussian broadcast network. We begin by defining 
two viewpoints in which schemes can be constructed. A transmitter- centric scheme in one in 
which the scheme is constructed from the viewpoint of the transmitter, where the codebook at 
the transmitter is first selected using a random coding argument and then the receiver chooses 
its de-codebook in accordance with the realization of the transmit codebook. In contrast, in 
a receiver- centric scheme, we fix the decode-book, which comprises of the mappings from the 
received vectors to the messages, and based on these mappings, the transmitter chooses its 
codebook to ensure low probability of error. 

Because random coding is done at the source, we can think of this scheme as first con- 
structing the transmitter codebook in a random manner and the receiver then constructs its 
de-codebook as a function of the realization of the transmit codebook. While the scheme 
for the linear deterministic multi-source network is transmitter-centric, the scheme for the 
linear-deterministic broadcast network is receiver- centric. 
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4.3.1 A Point-to-Point Channel 



For a point-to-point channel, the usual random coding scheme |4] can be regarded as either 
a transmitter-centric scheme, which is the traditional viewpoint (since the random codebook 
is thought of as being constructed at the source), or as a receive-centric scheme. It can be 
viewed as a receiver-centric scheme, because, at the receiver we construct a vector quantization 
codebook (alternately viewed as the decode-book) or rate R, which "quantizes" the received 
signal y T to a vector x T (m), for some m where m is the message index and x T (m) is the 
m-th quantization codeword. Now the source sets its codebook to be equal to the vector 
quantization codebook at the destination. This scheme is the same scheme as the usual random 
coding scheme. The distinction between transmitter-centric and receiver-centric schemes in 
this example is therefore one of personal preference, rather than an enforced one. 

4.3.2 Multiple-Access Vs. Broadcast Channel 

In some networks, we may not have the luxury to use the two viewpoints simultaneously, in 
which case we need to choose between the two. In the capacity-achieving coding scheme for 
the multiple access channel |3], the random coding is done at the transmitters and the receiver 
does joint typical-set decoding, based on the specific codebooks constructed at the sources. 
This provides a good example of a transmitter-centric scheme. 

In contrast, for the two-user broadcast channel, we can now view the Marton coding 
scheme (0 E]), used in Sec. \2.2.2\ as a receiver-centric scheme. In this scheme, there are 
two auxiliary random variables, U\ and U2, which we view as corresponding to the vector 
quantization variables at the two users. The receiver i can be thought of as constructing a 
vector quantization codebook which "quantizes" the received vector to Uf(wi), where w^ 
is an index belonging to a set larger than the set of all messages to user i, and bins the set of 
all Wi to the message m 8 for user i. The transmitter, to transmit a message pair (mi, 7712), finds 
a pair (wi, W2) such that Ui(wi), U 2 T (w2) is jointly typical. From this viewpoint, the receivers 
are choosing random de-codebooks, and the transmitters are choosing specific codebooks to 
be a function of the realization of the de-codebook. Thus the coding scheme can be viewed 
as a receiver-centric one. 

4.3.3 Multiple- Access Vs. Broadcast in Linear-Deterministic Networks 

From [T6|l6]. we know that a transmitter-centric scheme, where the sources and the intermedi- 
ate nodes perform random coding, is optimal for the many-source single-destination problem 
in the linear determinstic setup. Intuition suggests that a natural receiver-centric method 
should work for the reciprocal network (i.e., single-source multiple-destination network). In 
particular, the relays perform random mappings, and the destinations perform "random de- 
coding", i.e., they fix a random linear mapping from the received vector into a smaller message 
vector. Once these mappings at the relay and the destinations are fixed, the source evaluates 
the induced linear broadcast channel between the source and the various destinations; and 
constructs a linear broadcast code for this channel. This scheme can then be shown to be 
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Figure 5: Reciprocity in linear deterministic networks 



optimal for the broadcast network, because this is the reciprocal of the linear random coding 
scheme, which is optimal for the multi-source single-destination network, as shown in Fig. |5j 

4.3.4 Scheme for Gaussian Broadcast Networks: Lifting Scheme as a Receiver- 
Centric Scheme 

The general idea for the scheme for the linear deterministic broadcast network is the foundation 
of our scheme for the Gaussian broadcast network in Sec. [3J In order to build the scheme 
for the Gaussian network, we first construct a scheme for general deterministic networks (of 
which the DSN is a special case) and then lift the scheme from the DSN to the Gaussian 
network. In case of the linear-deterministic broadcast network, the source-mapping depended 
on the specific relay transformations used, not just on the probability distribution used to 
create the relay transformation. Extending this idea, we would like to construct a scheme 
for the general deterministic network, where the source codebook is a function of the specific 
relay transformation. Indeed, we resolve this problem by constructing a Marton scheme at 
the source for the vector broadcast channel induced by the specific relay mappings. 

Next, the scheme for lifting codes from the DSN to Gaussian relay networks proposed in 
[3] requires each node, including the destination, to prune their received vectors to a restricted 
set to ensure that the received vector in the DSN can be decoded from the received vector in 
the Gaussian network. Since this scheme restricts the received codewords at the destination, 
this scheme also naturally fits into a receiver-centric viewpoint. 

While the lifting procedure proposed there works only for single-source single- destination 
networks, we extend the procedure to our specific scheme for broadcast network. We achieve 
this by designing a pruned Marton code, in which the receivers are guaranteed to receive 
vectors which are in the pruned set. Instead of binning the set of all possible received vectors 
into messages, as we would for a broadcast channel, we now bin only the pruned received 
vectors to construct the pruned Marton coding scheme. The natural alignment of the receiver- 
centric viewpoints of the Marton scheme and the lifting scheme allows us to construct the 
scheme for the Gaussian broadcast network. 
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4.4 Approximate Reciprocity in Gaussian Mult i- Source and Broad- 
cast Networks 

In this section, we will demonstrate that there is an approximate reciprocity in the capac- 
ity regions of a Gaussian multi-source network and the corresponding reciprocal Gaussian 
broadcast network. 

In our model, we have assumed, without loss of generality, the average transmit power 
constraint of unity at each node. We have also assumed that the reciprocal network, in 
addition to having the same channel coefficients, also has unit power constraints at each node. 
However, it is not clear if this is the "right" way of defining the corresponding reciprocal 
network. For instance, in [2Ql [2T] , MAC-BC duality was shown under the assumption of same 
total transmit power in both networks; however this power could be divided amongst the 
nodes in a different manner in the forward and reciprocal networks. Under this assumption, 
it was shown that the capacity region of the two networks was identical. However, since we 
are concerned only about approximate reciprocity in this section, which is a weaker form 
of reciprocity, our definition of unit power constraint everywhere will be sufficient to show 
approximate reciprocity. 

In [16] and [6], a coding scheme is given for the Gaussian network with many sources and 
is shown to achieve the cut-set bound region within a constant gap, which depends only on 
the network gain. In Sec. [31 we have showed that for the Gaussian broadcast network also, 
we can achieve the cut-set bound region within a constant gap. As a result, to show that 
the capacity region of the two networks are themselves within a constant gap, which depends 
only on the network topology and not on the channel gains, all we need to do is to observe 
that cut-sets of the reciprocal networks are within a constant gap of each other. Note that 
the cut-set bounds corresponds to MIMO point-to-point channel where all the nodes on the 
source side of the nodes can be thought of as transmit antennas and all the nodes on the 
destination side can be thought of as receive antennas. The relationship then between a cut in 
a network and the corresponding cut in the reciprocal network is the same as the relationship 
between a MIMO channel with channel matrix H and the reciprocal MIMO channel with the 
channel matrix H T . The reciprocity of MIMO channel has been shown in [23], under equal 
total transmit power, i.e. the capacity of the two networks is the same. It can be further shown 
that restricting to per node power constraint only leads to a loss which does not depend on 
the channel gains. Therefore, we can show that the cut-set bounds are reciprocal. 

4.5 Induced Coordination in Relays' Transmission 

Let us consider a simple example for a broadcast relay network comprised of a single source, 
two relays and two destinations, shown in Fig. O The link between the source to the two 
relays is infinite, which implies therefore that the network is essentially a MISO broadcast 
channel with two transmit antennas and two receivers, each with a single antenna. It is clear 
that for a MISO broadcast channel, independent coding across the two relays is insufficient to 
even obtain the best possible degrees-of-freedom. Therefore, any scheme that is approximately 
optimal needs to perform coordinated transmission at the relays. 
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Figure 6: MISO broadcast channel as a special case of broadcast network 



In the proposed scheme, the relays perform quantization followed by independent encoding 
of the quantized bits into transmitted vectors. At a first glance, a scheme in which the relays 
are performing independent mappings seems incapable of attaining good performance because 
of the inability to induce coordination. However two key features in the proposed scheme help 
avoid this pitfall. 

• The relays Ri and R2 perform quantize-and-encode relaying in the aforementioned ex- 
ample, in spite of the fact that they can decode the source message completely. Had 
the relays decoded the source message and performed independent encoding, there is no 
possibility of achieving the degrees-of-freedom of even this simple broadcast network. 

• The source takes into account the specific realizations of the relay mappings and con- 
structs the coding scheme. This ensures that from the point of view of the receiver, the 
signals transmitted by the two relays appear coordinated. In particular, in this example, 
since the channel from S to Ri is infinitely good, the relay Ri quantizes the received signal 
to a very fine degree and encodes this for transmission to the destination. This gives the 
source many degrees-of-freedom to encode information in the various least-significant- 
bits of its transmission, so that after the relay mappings, the relay transmissions appear 
coordinated. 

4.6 Channel State Information 

We now examine the channel state information required at the various nodes for the schemes 
proposed in Sec. [2] and Sec. [3] for deterministic and gaussian broadcast networks. 

4.6.1 Deterministic Broadcast Networks 

For deterministic networks, the following channel state information is required: 

1. All non-source non-destination nodes are unaware of any channel state information. 

2. We assume that each destination knows the distribution of the received vector Y^, and 
the transmitted rate -^(£3) for each i 3 . The destination bins the set of all typical vectors 
into 2 TlT ' 2Ri bins corresponding to the messages, and uses this as the decoding rule. 
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3. The transmitter is assumed to have full CSI, and knows the relay mappings at all nodes 
and also the binning scheme at the destinations. The transmitter construct the codebook 
using the same binning scheme as the receiver. 

Thus this scheme has the interesting property that if the transmitter had all knowledge, 
the intermediate nodes have zero knowledge and the destination has a little knowledge (about 
the distribution of the received vector) , then the same rate can be achieved as the complete 
CSI case. 

This is dual to the situation of the multi-source single-destination network, where the 
receiver having full knowledge, intermediate nodes having zero knowledge and the transmitters 
having a little knowledge (about the distribution of the transmitted vector) can achieve the 
same rate as full channel knowledge. 

4.6.2 Gaussian Broadcast Network 

For Gaussian networks, the following channel state information is required: 

1. All non-source non-destination nodes are unaware of any channel state information. 
Node v however knows the probability distribution of the received and transmitted vectors 
Vy v iPx v for the corresponding DSN, which will be used to calculate the relay mappings. 
The node v also needs to use the received vector distribution to pick a pruned subset of 
the typically received vectors in the corresponding DSN. 

2. We assume that each destination knows the distribution of the received vector in the 
corresponding DSN Yo i; and the transmitted rate Ri{t^) for each £3. The destination 
maps bins the set of all typical vectors into 2 TlT2Ri bins corresponding to the messages, 
and uses this as the decoding rule. 

3. The transmitter is assumed to have full CSI, and knows the mappings used at all the 
nodes and also the binning scheme at the destinations. The transmitter then uses the 
same binning scheme used at the receiver. 

This scheme has the interesting property that if the transmitter had all knowledge, the 
intermediate nodes and the destination have some knowledge, then the same rate can be 
achieved as the complete CSI case. 

5 Generalizations 

In this section, we present various generalizations of our result, for half-duplex networks in 
Section 15.11 for networks with multiple antenna in Section 15.21 and for broadcast wireless 
networks, where some set of nodes demand the same information and other nodes demand 
independent information in Section 15.31 
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5.1 Half Duplex Networks 

Our discussion so far has been restricted to the context of full duplex scenario. A network is 
said to be half duplex if the nodes in the network can either transmit or receive information, 
but not do both simultaneously. Therefore the network needs to be scheduled by specifying 
which nodes are listening and which nodes are transmitting at any given time instant. Let 
the set of all possible half-duplex schedules at any time instant be %. An edge is said to 
be active at time slot k if Vi is transmitting and Vj is receiving at that time slot. 

Consider K time slots and at any time instant k, let hf. G % be the half duplex schedule 
used, and h K be the sequence h\, h,2, Hk- We consider only static schedules here, that is, 
schedules that are specified apriori and do not vary depending on dynamic parameters like 
channel noise. For any static schedule h , we can unfold the network graph with respect 
to that schedule. This procedure is performed in p], and is the same as the procedure in 
Section I3T21 except for the following difference: v\[k] is connected to ^[fc + 1] with a link h VltV2 
only when e VlV2 is active at time slot k. 

Given that the network is operated under a schedule h K , we define the set of all rate pairs 
achievable as the capacity region under the schedule h K . An upper bound on the capacity 
region under the schedule h K is given by the cut-set bound in the unfolded layered network 
corresponding to the schedule. This rate can be achieved within a constant gap by using 
Theorem [TJ Thus for any schedule h K , any rate tuple within the constant k = 0(| V| log ( | V | ) ) 
of the cut-set bound can be achieved (to within a constant number of bits) using that schedule 
and then using the scheme of Theorem [1] for the unfolded layered network. Now, we can 
optimize over all schedules h G % allowed under the half-duplex constraints. Thus, the 
capacity region of the network under static half-duplex scheduling is the union over all possible 
schedules of the capacity region under schedule h K . Therefore, any rate tuple (Ri, Rj) such 
that (Ri + k, R2 + k, Rj + k) is in the capacity region of the network under static half-duplex 
scheduling can be achieved by using the method described here. 

5.2 MIMO 

In this section, we consider the implication of having multiple antenna elements at each of 
the nodes in the network. Suppose v possesses m v antenna elements, which are used for both 
transmission and reception. The basic result for multi-antenna broadcast networks is the 
following. 

Theorem 3. For the multi-antenna broadcast network, a rate vector (Ri, . . . ,Rj) is achiev- 
able, 

{R 1 + k,...,R J + k) EC (39) 

for some constant k, which depends only on the number of nodes, and not on the channel 
coefficients, and k = 0(M log M) where M = J2v£V m v ■ 

Proof. The proof is essentially the same as the one for the single antenna case in Section [3J 
We only outline the proof below, highlighting the key distinctions. 
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1. For a given multi-antenna Gaussian network, we first obtain a multi-antenna DSN such 
that the cut-set bound for the Gaussian network and the cut-set bound under the product 
form distribution for the DSN differ only by a constant (in the same manner as we 
obtained Lemma [2] in Sec. 13.1.11 This is because (1721) continues to hold for multi-antenna 
networks, with |V| now replaced by the total number of antennas M. 

2. Then we use Theorem [2]to show that the cut-set bound under product form distribution 
is achievable for the DSN. 

3. We can now prune the codebook for the DSN by a factor k, to get a pruned scheme for 
the DSN, V K . The pruned scheme achieves a rate Mk lesser than the original rate. 

4. Under the pruned scheme, the received vector in the DSN can be decoded from the 
received vector in the Gaussian network. 

5. Therefore the DSN coding scheme can be emulated in the Gaussian network, and achieves 
a rate a constant k = 0(M log M) lesser than the cut-set bound in the Gaussian network. 

□ 



5.3 Broadcast-cum-Multicast 

The broadcast network comprised of a single source S and destinations Di, D 2 , Dj de- 
manding independent messages at rates R±, R 2 , Rj. Suppose that in addition there are 
also other multicast destinations M%, M 2 , Ml that demand all the messages transmitted by 
the source. We call such a network a broadcast-cum-multicast network. In this section, we 
will show that even for such networks, the cut-set bound is achievable to within a constant 
number of bits. This network is a generalization of both the multicast network considered in 
P and the broadcast network considered in the previous sections. 

First we note that the cut-set bound for the broadcast-cum-multicast network is given by 
the cut-set bound for the broadcast network, along with the cut-set constraints for each multi- 
cast receiver. In particular for the Gaussian broadcast-cum-multicast network, if (Ri, ...,Rj) 
is achievable, then V J C [ J] there exists a joint distribution Q such that, 

Rj< mm I(X Q ;Y QC \X nc ), (40) 

and in addition, the sum rate is constrained by all the multicast destinations since all these 
destinations demand all the messages transmitted by the source 

Ru] < min min I(X Q ;Y n c\X n c). (41) 

1 1 ie[L] fieA M . 

The set of all rate tuples inside the cut-set bound is then denoted by C. 

The main result for the wireless broadcast-cum-multicast network is that any rate a con- 
stant away from cut-set bound is achievable. 
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Theorem 4. For the Gaussian broadcast- cum-multicast network, there exists a constant k, 
which does not depend on the channel coefficients and is 0(\V\ log |V|), such that a rate vector 
of (R\, Rl) is achievable whenever 

(R 1 + k,...,R J + k)eC. (42) 

To prove this result, we follow an approach similar to the one we took for broadcast 
networks. First we will prove a result for deterministic broadcast-cum-multicast networks. 
Second, we show that the Gaussian network can emulate the deterministic superposition 
network with a constant rate loss. These two steps are completed in the rest of this section. 



5.3.1 Deterministic Broadcast-cum-Multicast Network 

The next Lemma shows that for the deterministic broadcast-cum-multicast network, the cut- 
set bound evaluated under product form distributions is achievable. 

Lemma 9. For the deterministic broadcast-cum-multicast network, the cut-set bound under 
product-form distributions is achievable, i.e., a rate vector (Ri, Rk) is achievable if for 
every J G [J] there is some product probability distribution Q p , such that, 

Rj < min I (X Q ;Y QC \X QC ) (43) 

HGA j 

= mm H {Y nc \X nc ) , and (44) 
R[j] < mm min I (X n ; Yqc\Xqc) (45) 

ie[L] Cl£A M . 

= min min H (Yqc\Xqc) . (46) 

Proof. Coding Scheme: The source operation, relaying operations and decoding operations at 
the broadcast destinations remain the same as in Section 12.21 In addition, we need to specify 
the decoding operation at the multicast destinations. 

At each level-3 block, each multicast destination performs a typical set decoding with 
the set of all possible Xj 2 (corresponding to all possible messages) and finds the unique 

WifeO,-.., Wj(t 3 ) for which the {X^{W 1 {t z ),...,W J {h)),Y^ e T^(X s ,Y Mi ). 

Performance Analysis: If the cut-set bound for the broadcast destinations is satisfied, then 
the probability of error at these destinations is guaranteed to be small (by the same analysis 
in Sec l2.3p . We need to analyze the error events at all multicast destinations. The probability 
of error at destination Mi in the t^-th level-2 block goes to zero, with T2 — > 00 if 

R[J](tz) < ^I(X s ;Y Mi \F r = f r (t 3 )) (47) 
J 1 

= ±H(Y Mi \F r = f r (t 3 )). (48) 
J 1 

As before, since the overall rate is given by averaging the rate across all T 3 blocks, as 
T3 — > 00, the probability of error at destination Mj is small if, 

R [J} < ^H(Y Mi \F r ). (49) 
J 1 
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Using Lemma [U we can relate the entropy term above to the cut-set w.r.t. this destination, 
i.e for any arbitrary e > 0, 37\, s.t., we have 

1 -H(Y Mi \F r )> mm H (Y n c\X n c) - e. (50) 



□ 



Corollary 4. For the linear deterministic broadcast- cum-multicast network, the cut-set bound 
is achieved. For the deterministic broadcast- cum-multicast channel (a deterministic broadcast- 
cum-multicast network in the absence of relays and destination cooperation) the cut-set bound 
is achieved. 

Proof. In the former case, the cut-set bound under product form distribution is the same as 
the cut-set bound under general distributions since there is only one transmitting node in the 
network. The latter case can be proved by showing the cut-set bound for linear deterministic 
networks is optimized by product form distributions. □ 

5.3.2 Gaussian Broadcast-cum-Multicast Network 

We will now prune and lift the coding scheme from the DSN broadcast-cum-multicast network 
to the Gaussian broadcast-cum-multicast network. Since the encoding at the source and relay 
mappings are the same as a network with the broadcast destinations alone, the same procedure 
used for pruning and lifting the broadcast code in Sec. 13.1.21 and Sec. 13.1.31 can be used to 
lift the code for the DSN broadcast-cum-multicast network to the Gaussian broadcast-cum- 
multicast network. This procedure ensures that all nodes in the Gaussian network can decode 
the corresponding received vector in the DSN. Therefore, the destinations carry out the same 
decoding operation that they perform in the DSN. The performance analysis of the scheme is 
similar to the performance analysis for the broadcast network in Sec. El and it can be shown 
that any rate tuple .., Rj) that satisfies (Ri + k, Rj + k) G C can be achieved, where 
fc = 0(|V|log(|V|)). 

A Proof of Lemma H 

Proof. Fix any J C {1, . . . , J}. To prove the lemma, we consider a communication scenario 
where the source needs to send a message to a single destination which has access to Y Dj with 
rate R(j)- Define, 

W {J) ~ Uniform j[2^>]j, (51) 

and the mapping at the source node, 

■ W(j) — > Xg, which is generated using i.i.d. p(Xg). (52) 

Note that F(j) denotes a random source code-book and Fy R denotes random relay mappings. 
The probability of error conditioned on a given source code-book and relay mapping is given 
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by, 



{£ \F {j)) F Vr ) =f{y£ j (W j ) = Y^(w')\w' ? Wj,F (Jh F VR } 



(53) 



and the average probability of error, averaged across all code-books and random-relay map- 
pings, is given by, 

V e ^E[F{£\F (J) ,F VR }}. (54) 
By the coding theorem in p], we have 

V e ->• 0, as T oo, V R {J) < Cj{Q p ). (55) 

Now, 



%);^l%),fv fi ; 





H(W {J) ) 






TR(j) - 


E 


Fano 






> 


TR(J) - 


E 




TR(J) - 


(1 



H(Wj) - H(W {J) \Y< F (J) = f {J) , F Vr = f VR ) 



l+¥{£\F { j h F VR }R {J) T 



Letting R(j) = Cj(Q p ) — e\ and for large enough T, we have 



Further 



HW^YEJF^Fvx) > T{Cj{Q p )-e). 



I(W {J y,Y^\F {J) ,F VR ) = H(Y^\F {Jh F VR ) - H(Y^\F (j) ,F Vr ,Wj 

< H(YEjF VR )-H(Y£jF iJh F VR ,Wj,XT) 
= I{X t s] YE\F Vr ). 



Therefore, 

i(xI,ySJf Vr )>t(Cj(q p )-€). 

Since the channel is deterministic 

HiYSjF^) = /(Xj;F^|F v J 
> T(Cj(Q p )-e). 

The proof is completed by choosing 7\ to be the maximum T over all J . 



(56) 



(57) 



□ 



B Proof of Lemma [3 



Proof. Throughout this section, we will assume that the relay mappings are fixed to fv^?) 
without making this explicit in the conditioning expressions. We will assume N = 2 to begin 
with and establish a lower bound on |3i| (w.l.o.g.). 



3i = fef Genl^e.^^e^}. 



*T 2 -T 2 > 



-TV 
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We will consider two cases. 
Case 1; H{Y 2 \Y X ) > T x k 

Fix a yf G ©i. The set of sequences y 2 2 which are jointly typical with y[ 2 is given by 
T^ 2 (Y 2 \y[ 2 ) , and its size is of the order 2 T2H ^ Yl \ Now ©2 is a random 2~ T ' 2TlK fraction of the 
typical set T^ 2 (Y 2 ). Therefore we can show that , if H(Y 2 \Yi) > T±k, then w.h.p. as T 2 — > 00, 

©in7f 2 (f 2 |^)^0, (58) 

i.e. every y^ 2 G ©1 will be jointly typical with some y^ 2 G ©2- Therefore the size of the set is 
13^ = 16x1= 2 T2 ^^)- TlK ) w.h.p. as T 2 -> 00. Clearly 

|3x| >2 T2 ^~ 2T ^. (59) 

Case 2: H{Y 2 \Y X ) < T x k 

Fix a y{ 2 G ©1. For T 2 large enough, the probability that there exists a sequence y 2 2 G ©2 
which is conditionally typical given any y^ 2 G ©1 is given by p = 2 T2 ^ H ^ Y2 ^ Yl " , ~ TlK \ Consider 
an arbitrary subset ©n C © x of size 2 T2A . The probability that there is an element in © n 
which is jointly typical with an element in © 2 is given by: 

¥{3yf e& u :(y x \y?) e T?*{Y X ,Y 2 ) for some f 2 2 G 6 2 } 

= l-(l-p) 2T2A (60) 
> i-e~P 2T2A (61) 

= 1 - e -2^W- 2 !-i)--i« + A) (g2) 

-> 1, i£A>T 1 K-H(? 2 \? 1 ), (63) 

as T 2 00. 

So we will set A = T x k — H(Y 2 \Y X ) + e x . Thus if we divide ©1 into disjoint sets ©u each 
of size 2 T2A , then each will have at least one element in 3i- Therefore 

I3il > ^ (64) 

= 2 t 2 a (65) 
_ 2 r 2(^(^i)-r lK -Ti K +H(y 2 |n)) 

| 3i | >2 T 2 (H(y 1 )-2T ltt )_ (67) 



Therefore 



This completes the proof for the case when N = 2. By iterating this calculation, we can show 
that for a general N, w.h.p. as T 2 — > 00, 

|3i| > 2 T2 ^~ NT ^. (68) 

□ 



2n 



C Proof of Lemma 
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Proof. Let us consider the case where there are only two receivers i.e. J = 2. The proof 
extends similarly for the general case. 3 D t is the set of all typically received codewords at the 
destination A- We know from Lemma [3] that |3 D J > 2~ T2TlKN 2 T2H( - fD i\ Fv ^ =fv ^ t:i)) w.h.p. as 
T 2 —> oo. 

Since we are binning the set of all y^. G 3d, into 2 T2TlRi bins, to ensure each bin has at 
the least one codeword, we need for some e > 0, 

2 T2T 1 (R l (t 3 )+e) < |_ 

It is sufficient to have, 

2 T2T 1 (R 1 (t 3 )+e) < 2-T2T 1 KN 2 T2H(Y D JF VK =f VK (t 3 )) 

=► Rife) < ^H(Y Di \F Vn = f Vn (t 3 ))-NK. 
1 i 

Now, there is no error if corresponding to each message pair (Wife), W 2 fe)), the source 
can find a jointly typical (y^, Ud 2 ) * n ^ ne ^ n corresponding to (Wife), W 2 fe)). This can be 
done w.h.p. as T2 — > 00, as long as there are at the least 2 T2< - 7< - Yd i ;Yd 2' Fv tc = ^ tc (*3))+<:) p a j rs Q f 
(Vd-iiVd) in this bi n - This condition translates to 

|3Z3 3 1 > ? T 2 (I(Y Dl ;Y D2 \F V7z =f Vn (t a ))+e) 



This is satisfied if, 

2 T2{H{Y Dl \F Vn =f Vll (t A )))+H{Y D2 \F Vlz =f Vlz (t 3 ))} 



> 2 T 2( / (^l'^2l i? V TC =/v TC (t3))+e) 



2^2 7i (Ri (t 3 ) +-R2 (i 3 ) ) 2 2T 2 Ti Nk 

Rife) + R 2 fe) < ^H(Y Dl ,Y D2 \F Vn = f Vn fe))-2T 2 TiNK. 

J- 1 



□ 



D Proof of Lemma 2 



First of all, we note that the for the Gaussian network, the cut-set bound is given by letting 
the inputs be jointly Gaussian, i.e. Q = CM (0, K) with Kjj < 1 such that 

C Gauss = {(Ri,...,Rj) : Rj < C^ SS (Q) = mm\og\I + H n ^K n H* ntnc \, C [J]j($9) 
where Hq : qc is defined as the matrix such that 

Yn c = Hq^cXq + HqcqcXqc + Zqc, (70) 
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and Kq is the conditional covariance matrix of Xq given Xqc 

It is also well-known (see Lemma 6.6 in [I]) that restricting to the product distribution 
Q p = CJ\f(0,I), only leads to relaxation of the mutual information term in ( 1691) by at most 
min(|f2|, |fi c |), and therefore 



C^ auss (Q) < C^ ma {Q p ) + |V|/2 

= minlog|/ + i/^^l + |V|/2. (71) 

neAj 

The following proposition shows that every cut in the Gaussian network is within a a 
constant gap to the cut in the DSN. 

Proposition 1. For every Q G Aj, there exists a product distribution Q^ SN = Yivev Pi-^-v SN ) 
such that, 

log \I + Haja-Hfal < 1 ( Y £ SN ; XE SN \X° SN ) + 0(\V\), (72) 

Proof. We will reduce the jointly Gaussian vector X to a vector that is valid in the DSN 
model and show that the reduction only leads to a 0(|V|) loss in mutual information. 

1. Note that 

log \I + Hq^cH^^cI = I {Hq^qcXq + Zqc; Xq) , (73) 

where X v ~ i.i.d. CA/"(0, 1). Throughout the rest of this proof, we will drop the subscripts 
Q and Q c for convenience. It is shown in the proof of Theorem 4.1 in [2] that if we 
restrict X v to only the fractional part, the loss in mutual information can be bounded. 
We present a quick sketch here for the sake of completeness. Let X v = f X v — [X v ] denote 
the fractional part. Then 

I(HX + Z;X) < I (HX + Z,H[X]) (74) 

< I(HX + Z;X)+H(H[X]) (75) 

< I{HX + Z-X) + Y J H{\X}) (76) 

wen 

< I(HX + Z\X)+±\Q\. (77) 

Note that X v e [-1/2, 1/2). 

2. Next, from [1] (Lemma 7.2), it follows that 

I(HX + Z;X) < I([HX] ; X) + I9\n c \. (78) 

Here [-ff^] corresponds to the output of the DSN when X is the input, however X still 
takes values in a continuous space and is not a permissible input in the DSN. 

3. Since the reduced channel given by Y = [HX] is deterministic and further the output 
Y takes values in a finite set, we can also restrict X v to take values from a finite set 
within [—1/2, 1/2) without any loss of mutual information. Let us call this vector X N , 

J( [HX] ;X) = I( [M DSN ] ; X DSN ). (79) 
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We have therefore showed that 



log \I + Hq^H^I < I (F£ SN ; X^\XEn + 23|V|. (80) 
This completes the proof of proposition. □ 
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